Human Action Recognition and Shape Segmentation-Recognition
نویسنده
چکیده
Human Action Recognition. Human action recognition has broad range of applications such as video search, sports analysis, human robotics interactions, and health care. Our work is organized in two directions: 1) detailed pixel-level ‘motion and pose’, focusing on close interactions among people; 2) action recognition focusing on goal oriented motion, simplified as ‘action = motion + intention’. In “Detecting Unusual Activity in Video” (cvpr2004), we demonstrated that using large amount of un-labeled video data and a robust graph co-clustering approach, one can uncover visual patterns of un-usual and usual actions. This was an exciting discovery, as it suggested that big-data can solve this hard vision problem without explicitly defining action categories, and without detailed analysis of human motion. Through more experiments, it was clear that such big-data approach has an ‘autistic’ limitation: it memorizes many details, but understands little intricate relationships of human motion and causality among them. It has little ability to make long-range prediction of future actions. My recent work on human action recognition is aimed to resolve this ‘autistic’ limitation. “Action = motion + intention”. Intention as And-Or graph of Actor-Actions: Storyline model. In “Understanding Videos, Constructing Plots: Learning a Visually Grounded Storyline Model from Annotated Video” (cvpr2009), we studied causality among actions. Our Storyline model can be regarded as a stochastic spatio-temporal grammar, whose language (individual storylines) represents potential plausible explanations of new videos in a domain. The basic insight is that all the variations of an event share a goal directed sequence of actions (akin to how movies of the same genre has similar flow of story subplots). The variations, such as one falls down as he runs towards a car, are due to different effects of each action. Our method requires only short video segments accompanied by a text description of the actors and actions present in the video. The system requires no detail annotations of the actor and actions in the video. We model the storyline grammar as a probabilistic AND-OR graph. The Storyline inference
منابع مشابه
Human action segmentation and recognition via motion and shape analysis
In this paper, we present an automated video analysis system which addresses segmentation and detection of human actions in an indoor environment, such as a gym. The system aims at segmenting different movements from the input video and recognizing the action types simultaneously. Two action segmentation techniques, namely color intensity based and motion based, are proposed. Both methods can e...
متن کاملEvaluation of the Parameters Involved in the Iris Recognition System
Biometric recognition is an automatic identification method which is based on unique features or characteristics possessed by human beings and Iris recognition has proved itself as one of the most reliable biometric methods available owing to the accuracy provided by its unique epigenetic patterns. The main steps in any iris recognition system are image acquisition, iris segmentation, iris norm...
متن کاملA New IRIS Segmentation Method Based on Sparse Representation
Iris recognition is one of the most reliable methods for identification. In general, itconsists of image acquisition, iris segmentation, feature extraction and matching. Among them, iris segmentation has an important role on the performance of any iris recognition system. Eyes nonlinear movement, occlusion, and specular reflection are main challenges for any iris segmentation method. In thi...
متن کاملA New IRIS Segmentation Method Based on Sparse Representation
Iris recognition is one of the most reliable methods for identification. In general, itconsists of image acquisition, iris segmentation, feature extraction and matching. Among them, iris segmentation has an important role on the performance of any iris recognition system. Eyes nonlinear movement, occlusion, and specular reflection are main challenges for any iris segmentation method. In thi...
متن کاملTraining Set of Data Bin for Small Black Pixels Neighborhood Recognition of Each Boundary
We first describe how to “fuzzify” the estimated binary columns to create a [0,1]-valued column. Werefer to this [0,1] -valued column as the soft segmentation column of the noisy spectrogram column.Similarly to the collection of soft segmentation columns as the soft segmentation image, or simply asthe soft segmentation. The band-dependent posterior probability that the hard segmentation columnv...
متن کاملA Tree-Based Approach to Integrated Action Localization, Recognition and Segmentation
A tree-based approach to integrated action segmentation, localization and recognition is proposed. An action is represented as a sequence of joint hog-flow descriptors extracted independently from each frame. During training, a set of action prototypes is first learned based on a k-means clustering, and then a binary tree model is constructed from the set of action prototypes based on hierarchi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013